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Abstract 



The highest information rate at which quantum error-correction schemes work rehably on a channel, which is 
^ I called the quantum capacity, is proven to be lower bounded by the limit of the quantity termed coherent information 
. maximized over the set of input density operators which are proportional to the projections onto the code spaces of 
symplectic stabilizer codes. Quantum channels to be considered are those subject to independent errors and modeled as 
tensor products of copies of a completely positive linear map on a Hilbert space of finite dimension, and the codes that 
are proven to have the desired performance are symplectic stabilizer codes. On the depolarizing channel, this work's 
bound is actually the highest possible rate at which symplectic stabilizer codes work reliably. 
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I. Introduction 



Completely positive (CP) linear maps, fidelity, symplectic geometry, the method of types, quantum capacity, quan- 
^ r tum error-correcting codes. 

o 

O , 

^ ■ The problem of determining the capacity of quantum channels was posed by Shor |l| in the first paper 
ion quantum error- correcting codes (quantum codes, or codes, hereafter). He discussed it in the context 
^ ] of preservation of quantum states, which are to be used for quantum computation in the presence of 
c3 ■ quantum noise. There is a known upper bound on the quantum capacity based on the quantity called 
^ I coherent information, and some authors conjecture that this bound is tight 0, p. Section VI], 0], 
• • 0' 0- other hand, known lower bounds appear to have left much room for improvement. 

]For example, on the capacity of the depolarizing channel, which suffers uniform depolarization and 
^ ■ can be specified by Kraus operators a/1 — pi, y^p/3X, ^J-pj'iY, ^Jp/'iZ with / and X, Y, Z being the 
- identity and Pauli operators, respectively, the highest lower bound known is 1 — h{p) — plog2 3 for a 
'wide range of p, where h is the binary entropy function |^, [Q, ||TD[, ||TT|. Shor and Smolin 



argued this bound is not tight showing the existence of concatenated quantum codes that slightly go 
beyond it for a limited range of p, which revealed a remarkable feature of the issue of the quantum 
capacity. While their work and the subsequent analysis of DiVincenzo and these authors [|l3l abounded 
with suggestions, their code construction was apparently restricted, and explorations into the general 



nature behind their code construction and further analysis were awaited [|T3 . 

The aim of this work is to give a more general lower bound which partially closes the gap between the 
upper and lower bounds, at least qualitatively. The bound to be presented is expressed as the limit of 
coherent information maximized over the set of input density operators which are proportional to the 
projections onto the code spaces of standard algebraic quantum codes. This limit closely resembles the 
known upper bound on the capacity, which is the one defined in the same way but with the restriction 
on input density operators removed. The result is obtained by developing Shor and Smolin's idea on 
the basis of the geometric property of quantum codes, and incorporating a methodology from classical 
information theory. In other words, this work establishes an exponential lower bound on the highest 
fidelity of concatenated quantum codes used on a memoryless channel in an elementary enumerative 
manner employing the method of types , . This fidelity bound then gives the new lower bound 
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on the quantum capacity of memoryless channels. Unhke Shor and Smohn's or DiVincenzo and these 
authors' coding schemes [|12|, [jl3|, the codes in this work do not rely on purification protocols and fall 
in the class of standard 'in-place' quantum ones called stabilizer codes, which would be desirable for 
its simplicity of coding processes and the possibility to be used in quantum computation 
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Moreover, for the depolarizing channel, which has often been adopted as a channel model for analysis 
of quantum codes, it will be shown that this bound on the capacity is the highest possible that can be 
attained with standard quantum codes. 

Concatenated quantum codes are, in a sense. 



IS and 



analogous to classical concatenated codes 
form a subclass of the class of standard algebraic quantum codes, which are called stabilizer, additive 
or symplectic codes in the literature [|19|, [|], [Q, pT[| , 



2i, [10 . While the term stabilizer codes 



is prevailing, it would rather be called symplectic (quantum) codes or symplectic stabilizer codes with 
emphasis on the role of symplectic geometry in this work. A symplectic quantum code is a simultaneous 
eigenspace of a set of commuting operators, which is called a stabilizer. A stabilizer is obtained by 
constructing a code over a finite field which is self-orthogonal with respect to a symplectic bilinear 
form and then transforming it into operators on a Hilbert space through a one-to-one correspondence 
(a projective representation). A stabilizer of a concatenated quantum code, which will simply be 
called a concatenated code in what follows, is obtained by concatenating two such self-orthogonal 
codes and putting it through the representation. We refer to these two codes, or the corresponding 
quantum codes, an inner code and an outer one following Forney's terminology [|T^]. Shor and Smolin's 
concatenated code uses an inner code with restricted parameters. Namely, their inner code is an 
[[n, k = 1]] code, where an [[n, k]] code is a 2'^-dimensional subspace of the tensor product of n copies 
of a two-dimensional Hilbert space. This paper develops DiVincenzo, Shor and Smolin's analysis |12 
T3| to include that of concatenated codes with general inner [[n, k]] codes with 1 < k < n. 



To the still ongoing development of the theory of quantum channel coding, there have been many 
authors' contributions. Good surveys on these problems have been given in and 0. An incomplete 
list of contributions after either of these surveys includes p3[ , , , , p5| , , [|2^ , p8[ , [^] , 
|]3l| , [0. Especially, we have witnessed the determination of the entanglement-assisted capacity 
(see also |^) and the settlement of the additivity problem of the classical capacity for several classes 
of channels [^, |^ while these are not the capacity which this paper will be concerned with. 
Nor will it discuss continuous channel models such as quantum Gaussian channels 
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The argument below proceeds as follows. After stating the result (Section we first recapitulate 
the framework of symplectic codes in a self-contained manner assuming no formidable prerequisite 
such as knowledges on representation theory, though a few basic facts from geometric algebra p3[] , 
3^ are used (Section PTI]) . Then, concatenated codes are explicated in this framework (Section [TVD , 



and the lower bound on the capacity is proven in an elementary manner with the aid of the method of 
types (Section |V] and Appendix Thereafter, it is shown that this bound is the highest possible on 
the depolarizing channel if we restrict the coding schemes to symplectic stabilizer codes (Section!^ 
Finally, some remarks are given on the case of general channels and so on (Sections [VI1| and [VIII 
Appendices are given to prove lemmas and a theorem. 

II. Quantum Capacity and New Lower Bound 

As usual, all quantum channels and decoding (state-recovery) operations in coding systems are 
described in terms of trace-preserving completely positive (TPCP) linear maps |^, 0, 
Given a Hilbert space H of finite dimension, let L(H) denote the set of linear operators on 

) has an operator-sum 
13, g. When M is 



H. In general, every completely positive (CP) linear map Ai : L(H) — ^ L(H 
representation M{p) = Eiei^iP^/ with some Mi G L(H), i e X [||], |g, |3 
specified by a set of operators {Mjjjgj in this way, we write A4 ~ 

^Here is a word about notations on ordered sets. Though sets of the form {xi}i^x represent ordered ones with arbitrarily fixed 
orderings in principle, the set operation U will sometimes be applied to these as in {xi}i^x U {yi}i£x' if the order really does not 
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Hereafter, H denotes an arbitrarily fixed Hilbert space of dimension d, which is a prime number. A 
quantum memory less channel is a TP CP linear map v4:L(H)^L(H). A memoryless channel A is 
supposed to act on a state or a density operator p in L(H®"') as A^"'{p). A pair (CnjIZn) consisting of 
a subspace C„ C H®" and a TPCP linear map Tin '■ L(H'^"') —>■ L(H®"), which is supposed to serve as 
a recovery operator, is called a (quantum) code, its information rate (or simply rate) is defined to be 
log^dimC„, and its performance is evaluated in terms of minimum fidelity [Q, [jl3|, 

F(C„,7^„^^") = min (V>|7^„^®"(|V')(V'I)|V'), (1) 

where TZnA®"' denotes the composition of A®"' and Tin- Throughout, bras (-1 and kets |-) are assumed 
normalized. A subspace Cn alone is also called a code assuming implicitly some recovery operator. 

For simplicity, we will work on a special class of channels that are specified as follows though the 
lower bound to be presented is applicable to general channels (Section |Vlip . Fix an orthonormal basis 
{|0),..., |d- 1)} of H. Put A' = {0,...,d- 1}2 and 

%,.)=X*Z^ {i,3)eX, (2) 

where X, Z G L(H) are Weyl's unitary operators defined by 

X|j) = |(j-l)modrf), Z\3)=uj^\3) (3) 

with uj being a primitive ci-th root of unity [|^, |^, [^]. Observe the relation 



XZ = uZX. (4) 

The set N = {Nu}u£X is a basis of L(H) and could be viewed as a generalization of the Pauli operators 
(including the identity). We will treat channels that can be written as ^ ~ {\J P{u)Nu]u£X, which 
will be called Pauli channels or N-channels, where P is a probability distribution on X. From the 
basis {X(i,j)} of L(H), we obtain a basis N„ = {N^}^^x^ of L(H®"), where = N^^ ® . . . ® N^^ 
for X = {xi, . . . ,Xn) e A"". With this notation, we can write ^®"(p) = Ylx^x^ P^i^)^xpNl for 
A ~ {\/ P{u)Nu\u£Xi where for a probability distribution Q on a finite set 3^, the product measure 
is defined by Q^{y) = nr=i QiVi). y = {yi, ■ ■ ■ ,yn) e y^ Since A^^{p) = Exg;t" P''{x)N,pNl can 
be viewed as the probabilistic mixture of states N^pN^ with probabilities P"'{x), we often say that an 
error occurs with probability P^{x) describing the action of the Pauli channel A. 

We can define the capacity of a quantum channel as in classical information theory. 

Definition 1: Let F*^[A®'^) denote the supremum of F{C,TZA®^) such that there exists a code 
(C C H®",7?.) with log^dimC > fc, where n > is an integer and fc, < < ra, is a real number. The 
supremum of nonnegative numbers R satisfying 



limsupF^^^„(^' 

n— >oo 

is called the quantum capacity of A and denoted by C(^). O 
Remarks. The term quantum capacity is used when one needs to distinguish it from other capacities 



(such as classical capacity) of a quantum channel [Q]. Variations exist in definitions of capacity con- 



cepts. Especially, besides the definition of quantum capacity above, there exists a seemingly different 
one based on entanglement fidelity, but actually they are the same 0. In the above definition, the 
stipulation limsup^ F*^^(^®"') = 1 may be replaced by liminf„ F*j:j„(^®") = 1; by slightly modifying 



matter as in operator-sum representations. 
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the proof of Theorem |l| to be presented below, it easily follows that the main result holds true if the 
limit superior is replaced by the limit inferior. □ 

Given a probability distribution Q on 3^ x Z, we define Q by 

which is a marginal distribution of Q, and define Q{-\u) by 

Q {v\u) = Q{u, v)/Q{u), V e Z. 

for u & y with Q{u) > while Q{-\u) is undefined for Q{u) = 0. The classical Kullback-Leibler 
information (informational divergence or relative entropy) is denoted by D and (conditional) Shannon 
entropy by H. Specifically, for probability distributions P and Q on a finite set 3^, we define D{P\\Q) 
hy D{P\\Q) = EueyPi^)^og,[P{u)/Q{u)] and/7(Q) hj H{Q) = -EueyQi^)^^SciQi^)- In addition, 
for a stochastic matrix P{-\-), i.e., a set of probability distributions P{-\u), u E y, and a probability 
distribution p on 3^, we define H{P\p) by 

H{P\p) = - Yl 

which is called the entropy of P{-\-) conditional on p. By convention, we assume log(a/0) = oo for 
a > and logo = 01og(0/0) = 0. 

For a density operator p G L(H') and a TPCP map A' : L(H') L(H'), the coherent information 
Ic{p,A') is defined by 

h{p,A') = s{A'{p))-s{[i^A']mm), 

where S{a) denotes the von Neumann entropy of a, I is the identity map on L(H'), and |\E') G H"(S)H' is 



a purification of p , [§] . For consistency, we assume that all logarithms appearing in these entropic 
quantities are to base d throughout the paper. 
This work's main result is the next one. 

Theorem 1: Let the basis N = {Nu\u£X = {^^^'^}(i,j)&x be specified as above. For a memoryless 
channel A ~ {\J P{u)Nu\u&Xi where P is a probability distribution on A", we have 

ri,,-^ /c((dimC)-^nc,^^") 
L(yij > sup max — ^ 

n>iCeS„(N) n 

where lie is the projection onto C and S„(N) is the set of symplectic stabilizer codes, the precise 
definition of which will be given in Definition |^ in Section [1II-A| . O 
After proving this, we argue that this bound is actually the 'conditional' capacity of the depolarizing 
channel on symplectic codes, which indicates the supremum of information rates at which symplectic 
codes work reliably. 

III. Codes Based on Symplectic Geometry 
A. Basics of Symplectic Stabilizer Codes 

In this section, the framework of symplectic codes is rebuilt on the theory of geometric algebra [p3|. 



Chapter III], [Q. In designing symplectic codes, we use Weyl's unitary basis [^, N = {Nu}uex, 



which has been specified by (0) and (^. We can regard the index of iV(j,j) = X'^Z^ , (z, j) G A" as a 
pair of elements from F = F^^ = Z/ciZ, the finite field consisting of d elements. Recall that we have 
put A^^. = A"^.^ (g) . . . (g) Nj.^ for a; = (xi, . . . , a;„) G (F^)''. We write Nj for [N^. e \ x e J} where 



J ^ (F^)". The index ((Mi,t>i), . . . , (un^Vn)) € (F^)" of a basis element can be regarded as the plain 
2n-dimensional vector 

X = {Ui,Vi, . . .,Un,Vn) G F^". 

We can equip the vector space F^" over F with the standard symplectic bilinear form (symplectic 
paring) which is defined by 

n 

y)sp = X] ^i^i ~ ^i'^'i 
i=l 

for the above x and y = {u[, v[, . . . , u'^, v'^) G F^" [^], |^. For a subspace L G F^", let L-^ be defined 
by 

L^ = {yeF'-\WxeL, {x,yU = 0}. 

A subspace L G f'^^ is said to be self- orthogonal (with respect to the symplectic bilinear form) if 
L C L^. 

The relation (H) implies the following two important properties of N„ (see, e.g., [H0|, ESl): 



-^x-^y Cxy-^x+y (6) 

for some constants (xy with l^^^^l = 1, x,y E F^", and 

NxNy = u^^'y^^-NyNx. (7) 

The latter implies that (x, ?/)sp = if and only if and Ny commute. 

The statement of the following lemma can be found in Gottesm an p . Section 3.2], A simple 
constructive proof based on the very basics of symplectic geometry [^|, is given in Appendix 0. 

Lemma 1: Let L be a self-orthogonal subspace with dimL = n — k and L = span {gi, . . . ,gn-k}- 
Then, we can find vectors gn-k+i, ■ ■ ■ ,gn and hi, . . . ,hn such that 

{9i,9j)sp = 0, (8) 
{hi,hj)sp = 

for i,j = 1, . . . , n, where Sij is the Kronecker delta. O 

Remark. In Gottesman's dissertation 0, A*"^. and A"^., i = n — k + 1, . . . ,n (see Section [111-E| ), 

appear as Zi and Xi, respectively, with examples of them for a number of symplectic stabilizer codes. 
□ 

A pair of linearly independent vectors {g, h) with {g, h)sp = 1 is called a hyperbolic pair, and it is 
known that a space with a nondegenerate symplectic form, such as the one defined by (|^), can be 
decomposed into an orthogonal sum of the form 

span {wi, zi} ± ■■ ■ ± span z^} 



in such a way that {wi,Zi), i = l,...,n, are hyperbolic pairs |33|. Following Artin [33|, we have 
referred and will refer to the direct sum of ?7i, . . . , as the orthogonal sum of spaces Ui, . . . ,Un if 
Ui, . . . ,Un are orthogonal. The three equations in the above lemma say that F^" is the orthogonal sum 
of span {gi, hi}, i = 1, . . . ,n. In the present case with the bilinear form in (^, the simplest example 
of such a decomposition of the space F^"" is span {ei, 62} ± ■ ■ ■ ± span {e2n-i, e2n}, where {ei}i<i<2n is 
the standard basis of F^" that consists of Cj = {Sij)o<j<2n G F^", 1 < i < 2n. 
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For the remainder of this section, we fix an arbitrary self- orthogonal subspace L with dimL = n — k 
as in Lemma [| and such hyperbolic pairs {gi, hi), . . . , {gn, hn) as just constructed. We define a linear 
map 7 : F^" F^", which depends on the basis {^^i, hi, ... , gn, hn}, by 

7(x) = {Wi, Zi, ...,Wn, Zn) (9) 

for a vector x G F^" expanded into 

n 

X = y^^jwigi + Zjhi). (10) 
j=i 

The j-th coordinate of 7(x) is denoted by 7j(x). In other words, we define 7^ by 



,1=1 

n 



'IV. 



2k 



l2k ^{x'^i-iQi + x'^ihi) 

\i=l 

for k = 1, . . . ,n. For z = {zi, . . . , Zm) G F™, 1 < m < n, we write 

m 

iv(^) = n(^'^>)'" (12) 

i=l 

where the product on the left-hand side is unambiguous because {NhJ^\ i = 1, . . . ,m, commute with 

m 

each other. Note that by A^^^-* and N^^, where x = the same up to a phase factor. 

i=l 

Similarly, for w = {wi, . . . , Wm) G F™, 1 < m < n, we write 

m 

iVH = l[{N,r^. (13) 

i=l 

We have seen that any basis {gi, . . . , gn-k} of a self-orthogonal space can be extended to {gi, . . . ,gn} 
in such a way that span {gi, . . . , gn} is self-orthogonal. Since Ng^, i = 1, . . . ,n, commute with each 
other by (|^), we can find a basis of L(H) on which Ng. are simultaneously diagonalized in matrix forms 
(e.g., 10). Hence, we can find an ?7,-tuple of scalars (/ii)i<i<n for which the space consisting of ip with 

Ng,'^ = fiiiJ, i = l,...,n, (14) 



is not empty. We call a nonzero vector (respectively, the set of vectors) satisfying (|Tj) an eigenvector 
(respectively, the eigenspace) of {^g,}i<j<n with eigenvalue list (/ii)i<i<n- Take a normahzed vector 
|0, . . . , 0) from this eigenspace, where the label (0, . . . , 0) belongs to F". Applying an operator A^^. on 
both side of (|14D from left and using (|^) as well as the symplectic property 

ix,y)sp (2/) '^)sp) 

we have 

N,Ng^^ = /i.AT.V' 
^ Ng^N^^P = /i,^(5-")-A^,^. (15) 



This means that N^i^ is an eigenvector with eigenvalue hst (yUjti;*^^''^^=p)i<j<„. If we expand x as in 
(|I0|), then we have {gi^x)^^ = Zi, i = 1, . . . ,n, and hence there are, at least, rf" possible eigenvalue 
lists for {iVg.}i<j<„. However, for any pair of distinct eigenvalue lists, the corresponding eigenspaces of 
{Ng.}i<i<n are orthogonal, and hence there are no more eigenvalue lists. Thus, we have an orthonormal 
basis {Jsi, . . . , Sn)} (si,.. .,s„)€F" defined by 

|si,...,s„) = Ar(^)|0,...,0), where s = (si, . . . , (16) 

Note that the basis . . . , Sn)}{si,...,sn)eF" depends on {gi, hi), i = 1, . . . ,n, as well as (/ij)i<i<n. 
Now we are ready to see the principle of symplectic codes. 
Lemma 2: [^, |T^. Let a subspace L C F^" satisfy 

L C L'^ and dimL = n — k. (17) 

In addition, let Jo C F^" be a set satisfying 

yx,y e Jo, [y - X e L-^ ^ X = y], (18) 

and put 

J = Jo + L = {z + w\ zeJo,wE L}. 
Then, the d^^-dimensional subspaces of the form 

{tp G H®" I VM G Nl, Mi) = r(M)^}, (19) 

where r(M) are eigenvalues of M G N^, are A^j-correcting codes. O 
In fact, the subspace 

C^'^ = span {|si, . . .,Sn-k,Sn-k+i, • • • ,s„) I (s„_fc+i, . . . , s„) G F'^'} (20) 

with a fixed (ra — A;)-tuple s = {si, . . . , Sn-k) £ F""'^ is such a quantum code. The equivalence of 
([T9| ) and (^) follows from d). Since there are c?"^'' possible choices for (si, . . . , Sn_fc), we have d"'~'' 
codes. The term codes is applied to both a self-orthogonal subspace L C F^", and quantum codes 
C^^^ associated with L, which we will call symplectic (stabilizer) codes with stabilizer N^. Since L"*" is 
spanned hj gi, . . . , gn and hn-k+i, ■ ■ ■ ,hn, any coset of in F^" is of the form 

n 

{ y^X^i9i + Zihi) \ Zi = Si, i = I, . . . ,n - = {x \ {gi, x)sp = Si, i = I, . . . ,n - k] (21) 

i=l 

with some (n — A;)-tuple s = (si, . . . , Sn-k)- In terms of 7^ defined by (|TTD , the coset in (^) can be 
rewritten as 

{x I 72i(a;) = Si, z = 1, . . . , n - fc}. (22) 

The set of cosets of and {N^C^^^ \ x G Jq], where A/'^C(°) denotes {N^^j \ G C(°)} with = 
(0, . . . , 0) G F"~'^, are in a one-to-one correspondence when Jq is a transversal (a complete set of coset 
representatives), i.e., when | Jo| = d"'~^. In fact, for any vector x in the coset in (|2T|) or (^), we have 
(cf. Section HITbI below) 

C(') = Ar^.C(°). (23) 

The (n — /i;)-tuple (sj)i<j<„_fc is called a syndrome on the analogy with classical linear codes. 

To show that the subspace, say C, in (|T^) or (^) is really A^j-correcting, we may use Theorem III. 2 
of Knill and Laflamme Alternatively, we can directly check the error-correcting capability using 
the recovery operator TZ defined by 

7^~{S,es}U{A^tSJ,,eJo, (24) 
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where Hx is the projection onto N^C = {N^ip \ "ip & C}, and Srcs is the projection onto the orthogonal 
complement of ^^eJo 

To be specific about which class of codes we are treating, we define the next. 

Definition 2: We define S„(N), n > 1, to be the set of all symplectic stabilizer codes with stabilizer 
iVi, i.e., all (i'^-dimensional subspaces of H®"^ of the form ([19|) or (^), with some subspace L C F^*^ 
satisfying (0) for some k, < k < n. O 

B. Tracing Errors 

Viewing index vectors in terms of the basis {gi, hi, . . . ,gn, hn} is also useful to trace the action of 
an error in N„ on a state in the code space. The view introduced in this subsection, as well as that in 
the next one, will underlie the proof of the main result to be given later. Let us consider the code C^'^^ 
in ( pOD assuming Si = ■ ■ ■ = Sn-k = 0, which loses no generality since (/ij)i<i<„ is arbitrary. 

Suppose an error A^^^ has occurred on a state p whose range (image, or support) is contained in the 
code space C^^\ We expand x as in ( [T0| ) and put 

t = {Wi, . . . ,Wn-k), 



Ui Zi^ji—ki i 1, . . . , /c, 

M- = Wi^n-kj i = 1, . . . ,k, 

U = {Ui,...,Uk), 

u' = K,...,o. 



(25) 



Then, for the purpose of analysis, we interpret the action of A^^ as follows: First, A^'*^ occurred to make 
no change on p, second, A^'-'^^ occurred to change p, which is a linear combination of 



\0,...,0,bu...,bk){0,...,0,b[,...,b',\, (6i,...,6fe), (&;,..., 6',) e F^ 
into the linear combination p' of 



pi! • • • 5 Zn-ki bi, . . . ,bk b',\, (6l,...,6fc),(6;,...,6'JGF^ 

with the same coefficients by ([T6|), and finally, X^Z^' occurred to act on p' as X„Z„'p'Z^,X^, where 
the actions of and Z^', u,u' G F'^, are defined by 



Xu\zi, . . . , Zn-k, bi,. . . ,bk) = \Z1,. . Zn-k, &1 + Ml , . . . , 6fc + Mfc) 

and 

k 

Zu'\zi, . . . , Zn-k, bi, . . . ,bk) = Y\_ t^""^'kl, • • • , Zn-k, bi,..., bk) 



1=1 

zn-k (U h.\ r- Ck 



for (zi, . . . , Zn-k) £ F" , (6i, . . . ,bk) G F . In other words, we have the next. 
Lemma 3: Let us given L and {gi, hi, . . . , gn, hn} as above. Then, we have 

NxpNl = X^Z^,N'^'^N^'^pN^'^^N^'^^zlxl 

= X„Z,,Ar(^)pAr(«)tzl,xl (26) 

for any operator p such that the ranges of p and p^ are contained in the code space in (^0|) and for any 
X e F^", where t,s,u,u' are determined from x through (p5|). 
This is clear from (|^) and 



n—k n—k 

X 

i=l i=l i=l i=l 



Wigi + ^ Zihi + ^ Uihi+n-k + U[9i+n-k 
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for p = Hem . For a general operator p, we should consider phase factors as is done in Appendix 0. 
Observe that the action of X„Z„/ is similar to that of A^(^,^, = {X"^®. . . .^Z"'-^) E 

K on states |0 . . . 0), . . . , |1 . . . 1) G H^". 

C. Coset Arrays and Probability Arrays 

To understand the action of errors in N„ on symplectic codes associated with the self-orthogonal 
subspace L G F^", it is helpful to consider cosets of L. Since dimL = n — k implies dimL^ = n + k 
(Lemma |l|), we have d"'~^ cosets of L"*" in F^", and each coset is a union of cP'' cosets of L in F^". To 
grasp the situation, we write down an array of cosets, which we will call a coset array of L, as follows: 

yQ + xo + L yo + xi + L ■■■ I/O + xk-i + L 

yi + xo + L yi+xi + L ■■■ yi + xk-i + L ^^^^ 

yu-i + Xq + L yM-i +Xi + L ■■■ yM-i + xk-i + L 

where K = d^^, M = d^~^, {xi} is a transversal of the cosets of L in L-*-, and {yi} is that of the 
cosets of L"*- in F^". In the array, each entry is a coset of L in F^*^, and each row form a coset of L-*- 
in F^". This array, which has appeared in Fig. 1 of DiVincenzo et al. [|T^ in a different configuration, 
resembles standard arrays often used in classical coding theory [^] though they differ in that 
elements of standard arrays are vectors rather than cosets. 

We have already seen that cosets of L-*- can be labeled with s G F"~'^ as in (^Tj) or (P^. Further- 
more, using hyperbolic pairs {gi, hi), i = 1, . . . , n, as in Lemma ^ we can label cosets of L in L-*- by 
{ui, u[, . . . , Uk, u'j^) G F^'^. In fact, since ((/j, hi), i = n — k + 1, . . . ,n, together with the basis elements 
Qi, i = 1, ... ,n — k, of L, form a basis of L-*-, each coset of L in L-*- can be written in the form 

{x G F^" I 72i_i(x) = Ui-n+k, l2i{x) = u'i_^j^k fom - + 1 < « < n} 

with some (mi, m'^, . . . , -u^, -u'^) G F^^. As a result, each coset of L in F^" can be specified by some 
(si, . . . , s„_fc) G F"~'^ and m'^, . . . , m^, m'^) G as the set that consists of the vectors x G F^" 
satisfying 

l2i{,x) = Si ioi I < i < n - k and 72^-1 (x) = Ui-n+k, l2i{,x) = u'i^^j^^ ioi n - k + I < i < n. 

Keeping this labeling in mind, we will introduce another important quantities, in terms of which our 
bound on the capacity will be described. Given a channel A ~ {\/ P{u)Nu\u£X) we define a probability 
distribution Pl by 

Pl[{s,u))= ^'^(^)' (28) 

x: 72i(a;)=«i for l<i<n-k and 72i-l (a;)=Mi_„+fc , 72i (a;)=it^_„_l_j, for n-k+l<i<n 

where s = (si, . . . , s„_fc) G F"~^,m = {ui,u'i, . . . ,Uk,u'i^) G F^'^. Now, arrange Pl{s,u) = Pl{{s,u)) 
into the array of probabilities 



PLiO„,^k,02k) PL(0„-fc,0...01) ••• Pi(0„-fc,ll...l) 

PL(0...01,02fc) Pl(0...01,0...01) ■■■ Pi(0... 01,11...!) 
Pi(ll...l,02fc) Pl(11...1,0...01) ■■■ Pi(ll...l,ll...l) 



(29) 



where Om denotes the zero vector in F™ and an m-tuple (61, ... , bm) G F™ is simply written as 61 ... 6^- 
We have assumed here c? = 2 in order that it may not look complicated, the general description being 
obvious. We will call this a probability array of L. Each probability Pl{s,u) is the probability of the 
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corresponding entry in (27) if the coset representatives Xi and yj are chosen accordingly. Note that the 
index s in Section |III-B| corresponds to the row index s in the probabihty array, and Ui, u'^ are used for 
the column index. We remark that Pl depends on the choice of hyperbolic pairs {gi, hi), i = 1, . . . ,n, 
but the probability array of L is unique up to permutations of rows and columns. 

D. Decoding Symplectic Stabilizer Codes 

Coset arrays are useful to understand the decoding principle of symplectic stabilizer codes. Let us 
given a code with stabilizer A^'^. As explained in Section |111-A| , once we specify Jo, (a subset of) a 



transversal of the cosets of L-*- in F^", the recovery operator of symplectic codes is determined from Jq 
as in ( plD in such a way that the code can correct errors in Nj, where J = Jq + L. The set J = Jq + L 
is a union of some cosets of L, and in view of (|27|) , each row of the coset arrays has exactly one coset 
(or none) which is a constituent of J. Thus, the design of a decoder of symplectic codes with stabilizer 
Nl is accomplished by choosing a coset from each row of the array. When the code in Lemma |^ is 
used on a memoryless channel A ~ {^y P{u)Nu}u£X, a natural choice for such a coset in each row 
may be one that has the largest value of Pl in the row, since it is analogous to maximum likelihood 
decoding, which is an optimum strategy for classical coding. Our codes to be proven to have the 
desired performance are concatenated codes, and our choice for Jq will turn out to be more technical 
exploiting the structure of concatenated codes. 

E. Remarks on Symplectic Stabilizer Codes 

When d = 2, often used is the slightly different basis {N'f^i j)}{i,j)ex the elements of which are defined 
by iV(',^^.) = X^Z^ = iV(,,,) for (0, 0), (0, 1), (1, 0) e X and iV(, = = v^iV(i,i) [!!,§. The 

arguments below all work if A^' is used instead of for d = 2. 

As already mentioned, the recovery operator for an A^j-correcting code in Lemma ^ is given by 
TZ ~ {A^^Sa-j^gjg, where we assume |Jo| = d^~^ for simplicity. A physical meaning of this recovery 
process is simple: It can be described as the orthogonal measurement {T,x}x(^Jo followed by the unitary 
operation Nl, which is chosen accordingly to the measurement result x. The measurement {S^j^gj^ 
is realized by the observables A'^., i = 1, . . . ,n — k, when d = 2 (and similar self-adjoint operators that 
correspond to Ng, i = 1, ... ,n — k, for d > 2). In this case, the syndrome is obtained as a measurement 
result. 



IV. Concatenated Codes 

The very first quantum code discovered by Shor is an example of a concatenated code. The 
idea of the following general code construction can be found in [|, Section 3.5]. Let L C F^" and 
-^^out ^ F^'^'^ be self-orthogonal codes with dimL = n — k and dim Lout = ku — k. Let {gi, . . . ,gn-k} 
and {g'l, ■ ■ ■ , gku-K.} be bases of L and Lout, respectively. Let {gi, . . . , gn-k} be supplemented by 
gn-k+i, ■ ■ ■ ,gn and hi, . . . ,hn to form a basis of the property in Lemma |l|. We will construct a new 
self-orthogonal code of length 2nz/ from L and Lout- 

For any vector x = (xi, . . . , X2„) G F^", let x^-^^ denote the vector (0, . . . , 0, x, 0, . . . , 0) G F^"'", where 
we have divided the 2ni> coordinates into u blocks of length 2n and x appears at the j-th block. Next, 
for any x = (mi,i, u[ -^^, . . . , Ui^k, u[ f^, . . . , u^^i,ul^^, . . . , Uy^k-, u'u^k) ^ f'^'"^, let us denote by x G f'^"'" the 
vector specified by 

u k 

~ '^j,in9n-k+m + '^j,m^n-k+m- (30) 

j=l m=l 

Especially, we apply this map to g^ to obtain g^, i = 1, . . . ,kv — k. Note that the map that sends 
X to X preserves the symplectic inner product. This is because mutually orthogonal hyperbolic pairs 
(e2i-i,e2i), i = l,...,ki', are mapped to mutually orthogonal hyperbolic pairs {g^^jlk+m^^n-k+m)^ 
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j = 1, . . . ,1^, m = 1, . . . ,k, where {ei}i<i<2ku is the standard basis of f'^^'^. Clearly, 

G = {^?^ \ i = l,...,n-k; J = l,...,u} 

is a set of mutually orthogonal independent vectors. Since g'-, i = 1, . . . , ku — n, are spanned by (^jflfc+m 
and /i|j^2^_^„^, j = 1, . . . m = 1, . . . ,k, which are orthogonal to each element of G, we see that 
G U {g'^ \ i = 1, . . . , ku — K,} C F^""^ is a basis of a self-orthogonal code of dimension {n — k)v + kv — n = 
ni' — K,. The code over F obtained by concatenating two codes L and Lout in this way will be denoted by 
cat(L,Lout)- Symplectic quantum codes associated with cat(L,Lout) ^ F^"'' of the above parameters 
have information rate n/{nv). 

Examples of codes with inner codes having parameter k = 1 can be found in the literature |T^, 
]TB[, [Q. For instance, a code with n = u = b and L = L^ut was given by Gottesman Section 3.5, 
Table 3.7] with a table of {A^"^ | ^ G cat(L, Lout)}- 

V. Proof of Theorem |T] and Remarks 

A. Proof of Theorem |i| 

The theorem can be obtained as a consequence of the next stronger statement, which is proved in 
Appendix |C|. 

Theorem 2: Let a function En,k be defined by 

En,kiR. Pl) = min[D(P'||Pi) + \k-kR- ff(P|F)|+] (31) 

where = max{a;, 0} and the minimum with respect to P' is taken over all probability distributions 
on F""'^ X V"^^ . Let R' satisfy < i?' < 1. Then, for a memoryless channel A ~ {\/ P{u)Nu\uex and 
any self-orthogonal code L C F^" with dimension — fc, 1 < < n, we have 

logJl-i^^,fi^nM^")] ^ E^^,{R'n/k,PL) 
limsup ■ > — ■ (32) 

m — >oo ^ 

where P^ is the probability distribution on F"~'^ x F^'^ defined by (|28|). O 
From the general property of the KuUback-Leibler information D that D{P\\Q) > with equality if 

and only if P = Q, it follows that En,kiR, Pl) is positive iikR < k-H(pl\P^). Hence, [k-H{K\PL)]/n 
is a lower bound on the capacity of the channel. Thus, the next corollary follows. 
Corollary 1: For the memoryless channel A, we have 

k-H{K\Pl) 
L(>1) > sup max , 

n>i ^ n 

where the maximum with respect to L is over all k with 1 <k <n and all L C F^" with dim L = n — k 
and L C L^. 

Remark. When n = k, a coset array of L consists of a single row, and H{Pl\Pi) is to be understood 
as H{Pl) = nH{P). In this case, [A; — H{PL\PL)]/n = 1 — H{P), which is the known lower bound 0, 

0, El- □ 

We also have the next. 

Lemma 4- Let A be the one in Theorem |T], viz., A ~ {\/ P{u)Nu}u£X) L be a self-orthogonal code 
satisfying (0), and Pl be defined by (P^). Then, we have 



k - H{Pl\Pl) = /c((dimC)-inc,^®"), 
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where C is a symplectic code with stabihzer N^. <> 

A proof is given in Appendix 0. Corollary |T] and Lemma | are extensions of the facts established 
by Shor and Smolin [|l^ and DiVincenzo and these authors [|13|, who restricted L to those of quantum 
repetition codes having parameter k = 1. 

Corollary |l|, together with Lemma ^ establishes Theorem |l|. 

B. Remarks on Theorems |7| and |^ 

The quantity H(Pl\Pl) appearing Corollary |^ can be written solely in terms of L, which specifies the 
quantum code, and P, which specifies the channels, and it does not depend on the choice of hyperbolic 
pairs {gi, hi), i = 1, . . . ,n, since H{Pi\Pl) is a function of the array or matrix in as is mentioned 
in the proof of Lemma Appendix 0, and its value does not change if we permute rows or columns of 
the array. Similarly, ^(i?, Pl) do not depend on the choice of hyperbolic pairs {gi, hi), i = 1, . . . ,n. 

C. Idea for Proof of Theorem ^ 

The theorem is proved with a random coding argument similar to those in |Ty] , |Tl[| , the main differ- 
ence being in the decoding strategy. A concatenated code associated with cat(L, Lout) is a symplectic 
stabilizer code, so that we can apply the decoding strategy described in Section |Ill-D| to it. Especially, 
minimum entropy decoder employed in |[T^, [[T^] can be used. In the proof of Theorem]^, however, we 
modify this decoding strategy incorporating Shor and Smolin's idea. Namely, we choose a vector that 
minimizes the conditional entropy of the type of it in each coset of cat(L,Lout)^ in F^"'^, where the 
conditioning is on the result of measuring the observables A^"^, g E G, when ci = 2, or similar ones for 
d > 2, which form a part of the syndrome of the concatenated code. 

VL Conditional Capacity 
A. Conditional Capacity and Upper Bound 

In discussing capacity problems on classical channels, we sometimes put restriction on coding 



schemes. For example, there are works on the highest information rate achievable by linear codes |50 
the conditional capacity with cost or power constraints and so on. In a similar way, we discuss a 
conditional quantum capacity in this section. Suppose for each n > 0, a set T„ of subspaces of H*^" is 
given. We imagine the situation in which only subspaces belonging to Tn can be used as codes. 

Definition 3: Let a sequence of code classes {T„} be given, and F*^(^®"'|T„) denote the supremum 
of F(C,7?.^®") such that there exists a code {C,1Z) with C and log^dimC > k, where ri > is 

an integer while A;, < A; < n, is a real number. The supremum of nonnegative numbers R satisfying 

limsupF;^J^^"|TO = l 

n— >oo 

is called the conditional quantum capacity of ^ on {T„} and denoted by C(^|{T„}). O 
Comparing this with Definition |l|, we see C(^) = C(^|{T„}) when we put no restriction on coding 

schemes, i.e., when T„ is the set of all subspaces of H*^" for each n > 0. 

We have an upper bound on the conditional capacity, a proof of which is given in Appendix ^ 
Lemma 5: Let a sequence of code classes {T„} be given. Then, 

C{A\{Tn}) < limsup sup '-, (33) 

n-^oo ceT„ n 

where He is the projection onto C. O 
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B. Conditional Capacity of the Depolarizing Channel on Stabilizer Codes 

In this subsection, we will see that the lower bound on the capacity obtained in the previous section 
is, in fact, a lower bound on the conditional capacity C(^|{S„}) of the depolarizing channel, where 
is the set of all symplectic stabilizer codes. To be precise, we put 

S„ = |JSn(N), 
N 

where S„(N) is defined in Definition ^ and N ranges over all L(H) basis of the form N = {Nu}uex with 
d^) and @ for some basis {|0), . . . ,\d — 1)} of H and some primitive d-th root of unity uj. We call an 
N-channel {^/P{u}Nu}u€X satisfying 

P^Jf'^-i) ';«^(°.°)- ,34) 

[1 — p if u = (0, 0) 

for some < p < 1 a (d-dimensional) p-depolarizing channel. 
Then, we have the next. 

Theorem 3: For the p-depolarizing channel with < p < 1 and for the present choice of S„, i.e., for 
Sn = Un Sn(N), we have 

C(^|{Sn}) = sup max ^ — ^ 

ceSn n 

/c((dimC)-inc,^®") 
= hm max — ^ 

n~*oo CgS„ n 

k - H{K\Pl) 
= hm max , 

n^oo L n 

where the maximum with respect to L is over all L C f"^^ with dim L = n — k,l<k<n, and L C L"*-. 



Here, the probability distribution Pl is defined by (|28D with (|3J). O 
To prove this, we use the next symmetric property of the depolarizing channel: For this channel, 

the representation — pi} U {^/p/J(P--l)Nu}u(zx\{{o,o)} does not depend on the choice of the basis 

{\0) , . . . , \d — 1)} and u, which determine N = {Nu}uex 01- Because of this property, we will obtain 

Theorem |^ if we show the next lemma. 

Lemma 6: Let a basis N = {Nu}u(^x be given through (|^) and (^. For an N-channel A ~ 

{^JP{u)Nu]u&x, we have 

.irc fl.^^r^ h{{diui Cyclic. A^^) 
C(y4. |S„(N)|) = sup max — ^ - 

„>iCeS„(N) n 

/e((dimC)-inc,^^") 
= hm max 

n^ooCeS„(N) n 

k-H{K\K) 
= hm max , 

n^oo L n 

where the maximum with respect to L is over all L C F^" with dim L = n — k,l<k<n, and L C L-^. 
Here, the probability distribution Pl is given by (|28|). O 

A poof of this lemma is given in Appendix |^. From the proof, it is clear that Theorem |] remains 
true even if we extend S„ = Un^kC^) ^^^^ includes the symplectic codes designed with N„ = 
{A^ii'' ■ ■ • eg) A^i""* I (xi, . . . , Xn) € X"-} instead of N„, where each {Nu^} is defined as N with some 
basis {|0),...,|(i — 1)} and some uj, which may vary according to i. 
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C. Superadditivity of Coherent Information 
The conditional capacity in Theorem |^ is the limit of c„/n, where 

= sup/c((dimC)-inc,^®"), n=l,2,.... (35) 
ces„ 

A natural question is whether lim„ c„/n > C\ or not. Shor and Smolin numerically demonstrated 
that lim„c„/n > Ci for very noisy 2-dimensional depolarizing channels, which showed the remarkable 
feature of c„, or its counterpart that is defined by (|35|) with S„ replaced by the set of all subspaces 
of H®". Note that lim^c^/n is an upper bound on the unconditional capacity C(^) by Lemma |^. For 
the erasure channel, lim„ d^/n is known to equal |5^, which is indeed the capacity. 



Here this paper reports that superadditivity of c„ has been observed for very noisy 3-dimensional 
p-depolarizing channel. Specifically, a numerical evaluation using the repetition code span {1100000, 
1010000, . . . , 1000001} as an inner code, where 1100000 G denotes the vector (1, 0, 1, 0, 0, 0, ... , 0, 0) 
G Fg^ and so on, has shown that for 0.2552 < p < 0.2557, C7 > while ci < 0. 

VII. Bounds for General Discrete Channels 

We remark that this work's bound holds true for general discrete memoryless channels (TPCP maps) 
as treated in Namely, if we associate the probability distribution P = P4 with a channel A [or 
P = PuA with some TPCP map U on L(H)] as in []TT], Section II], then the bound in (|5^) and that in 



Corollary □ are true for this channel. This can be shown in a quite similar way to that in |TT[ . That 



is, if the minimum fidelity F in ( |57D is replaced by the minimum average fidelity Fa introduced in 



HI , the same bound holds on Fa for general memoryless channels. Then, owing to the fact |]11| that 
a lower bound on Fa gives asymptotically the same bound on F, we obtain the lower bound on the 
minimum fidelity F of the best codes used on general channels. These bounds also apply to 'blockwise' 
memoryless channels (TPCP maps) An on L(H'^") if we associate the probability distribution Pn = 
on Af" [or Pn = PuA„ with some TPCP map U on L(H®")] with An as in lTl|, Section V, Definition 2] 



and use Pn in place of P" in 

VIII. Concluding Remarks 

This paper has presented a lower bound on the quantum capacity which has a close relation to the 
known upper bound based on coherent information ||^. This author conjectures that this bound is 
actually the conditional capacity of general Pauli or N-channels on all symplectic stabilizer codes. It 
might even be true that the lower bound is tight as one on the usual (unconditional) quantum capacity 
for Pauli channels, which would be proved by showing that the maximum of coherent information 
were nearly achieved by an input state proportional to the projections onto the code space of a 
symplectic stabilizer code for large enough n. If the quantum capacity were proven to be the coherent- 
information upper bound lim„c^/?T,, it would still leave room for investigation since the bound is a 



limiting expression and we do not know how to calculate it except for few cases [52 



In the previous work [10|, this author conjectured that the exponent appearing in the fidelity bound 
in [^ is not the optimum for some channels. Now this fact has been established at least numerically 



since the bound in []T0[ is the same as the right-hand side of ( |32D when ra = = 1, in which case 
Pl = P, and we have Shor and Smolin's numerical evaluation for the depolarizing channel, from which 
it follows that there exist some n > 3 and relatively large p such that F„ i(i?n, P/^) is positive while 
Fi^i(P, P) vanishes. The problem of determining the quantity in the left-hand side of ( P^j ) would 
deserve investigations in view of the great attention paid to the corresponding problem in classical 



information theory; an improvement on Fi i(P, P) for P with large P((0, 0)) can be found in [^ . 
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Appendices 
I. Appendix A: Proof of Lemma |I| 
In this proof, we use the property of a nondegenerate bihnear form (-,■)' on 1^ with diml^ = 21 that 

dim W + dim W^' = 21 (36) 
for any subspace W CV, where W-^' = {y E V \ E W, {x^y)' = 0} [|4l- We construct pairs 



hi) of the property in (|]) in the following two-stage algorithmic process. 

(i) Put m = n — k,V = F^" and W = L. Repeat Procedure 1. 

Procedure 1. If m = 0, then go to (ii). Since W is contained in W'^ fl V and < dimiy-*" fl ^ = 
2k + m < diml^ = 2{k + m) by (p6|), where in this case the bilinear form (-, ■)' is simply the 
restriction of (■, ^sp to V (see also ||3^, Proposition 2.9]), there is a vector hm E V \ W-^ with 
{gm, hm)sp = 1- Define a subspace V C F^" hj V = V -L span {gm, hm}, replace V with V, and 
replace W with span {gi, . . . , gm-i}- Decrease m by 1. 
Up to now, we have hyperbolic pairs {gi, hi), . . . , {gn~k, ^n-fc)- 

(ii) Put m = n — k. Repeat Procedure 2. 



Procedure 2. If m = n, terminate this procedure. Let V be defined by F = V -L span {gi, h 



gmjhm}- Choose an arbitrary nonzero vector gm+i ^ V- Since W = spanj^fm+i} is contained in 
W-^ n V and dim W-^ nV = 2{n - m) - 1 < dim V = 2{n - m), there is a vector hm+i \ W-^ 
with {gm+i, hm+i)sp = 1- Increase m by 1. 
Thus, we have hyperbolic pairs {gi, hi), . . . , {gn, hn). 

II. Appendix B: Proof of Lemma |3| 

In this proof, when x E F^" is of the form x = ^"^^ Zihi, we write A^<^> = YTi^iiNh,)^' where the 
product on the left-hand side is unambiguous because {Nh-Y% i = 1, . . . ,n, commute with each other. 
Similarly, when x E F^" is of the form x = Yl^=i'^i9i^ write N^^'^ = YYi=ii^9i)^^ ■ -Due to (^, A^^^ 
and M = A^<^''"«^i+"-'=)A^(^!"i9i+"-fc)A^('^)A^W^ where i runs from 1 to in the summations, are the same 
up to a factor of modulus one that solely depends on x, so that N^pN^ = MpM'^ . Hence, if and 
Zu' differ only by phase factors from A^(^»"»^i+n-fc) and A^<^»"iS«+"-fe), respectively, and the factors do 
not depend on (&j)i<i<fc when they act on \zi, . . . , 2„_fc, hi, ... , h^), then we will obtain the lemma. 

It is seen from (p!6| ) that the action of X„ is the same as A^^^^'^^'^^+n-fc). On the other hand, the action 
of Zu' is the same as up to an irrelevant phase factor. This can be seen by the following 

chain of equalities, where (|^, (|T^ and (|TB|) are used, and in all summations, i runs from 1 to k, and 
j from 1 to n — k, and A, A', which depend on x and hi, are defined by 

IVfi'SjZjhj+Sibihi + r.-k) = \]\T , , ^ , , 

-'^ ^-^^^jZjhj+T,ibthi+„_t. 

and 

■^1 ^iiJi-^n — k ' 
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which is possible owing to 



N'^^'<3^+-^^\z,,...,Zn-k,b,,...,bk) 
= ]\[{^i^i9i+n-k) ]\J'('^jZjhj+'£ibihi+ri-k) |g 



00^ 



= ^j'^i''^'i^i^'^i''^'it^i+n-k J\j{'SjZjhj+T,ibihi + „-i:) 

= u;''^<'^+''^<^^+"-''\z^,...,Zn-k,h,...,bk). 

III. Appendix C: Proof of Theorem g 

In this appendix, the lower bound in Theorem |] will be established using the concatenated code in 
Section |T^ as well as the notation therein. In the proof, the random coding proof method is employed, 
where as in DiVincenzo, Shor and Smolin |T^, |T^, an inner code L having parameters k and n, 
1 < k < n, is fixed and the average fidelity over all possible outer codes Lout is evaluated. Namely, we 
evaluate 

E i^(C(^out),7^^^"^), (37) 

where F is defined in (0), the ensemble A is specified below in (|45|), and C(Lout) is one of the o?"''-'^ 
symplectic quantum codes of dimension d'^ associated with cat(L,Lout), and 71 is determined from Jq 
which will be given below. It will be helpful to notice that span G is contained in cat(L, Lout), which, in 
turn, is contained in cat(L, Lout)"*", so that any coset of cat(L, Lout)"*" in F^"'" is a union of some cosets 
of span G. We will work largely with cosets of span G rather than individual sequences in F^^" because 
due to Lemma ^, error operators indexed by sequences in a fixed coset act on the states exactly in the 
same way. 

It is convenient to view 



in terms of the basis 



Let us expand y G F as 



y = iyi,i, yi,2n, yu,i, yu,2n) ^ ^ 



2nv 



j=l i=l 



y = Y^''^^^9f^ + '^^^^ (38) 
j=l i=l 

and consider the transformation that maps y to 

y' = {wi ). (39) 

Then, it is easy to see that each blocks of length 2n suffers the transformation 7 defined by (^): 

(U7j-1, Zj-l, . . . Zj-„) = 7((?/j-i, . . . ,yj-2n)), j = 1,...,Z/. 
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Note that the vectors y G F^"'^ which, when expanded as in (^), have the same Zj^i for j = 1, . . . , z/, 
i = 1, . . . , n, and the same Wj^i for j = 1, . . . , i/, i = n — + 1, . . . , n, form a coset of span G in F^"'^. In 
other words, the set 



{y I y)sp = Zj^i, l<j<v,l<i<n] {y, hl^^)^^ = Wj^i, l<j<u, n-k + l<i<n} 

for a fixed pair (wj^j)) G (F")*^ x (F'^)'^ is a coset of span G. With the decomposition of an error 

operator in Lemma |^ in mind, we rather write a coset of span G as 

{y I z{y) = z, v{y) = v} 

with a fixed pair {z,v) G (F""*^)*^ x (F^*^)'^, where the two sequences z{y) and v{y) are defined by 

%) = (zi,...,z.)G(F"~^)^ and viy) = (v^, . . . ,v,) e {F^'r , (40) 

and 

~ • • • ' Zj,n~k) and f j = (ti^j^^-fc+i, zj^n-k+i, • • • 5 2;j,„)5 j = 1, • • • , (41) 

with (|38|) . We denote by r(2;(|/), f (y)) the coset of span G in F^"'^ that contains y for any ?/ G F^"'^. The 
simplest coset representative of a coset r(z, v) is the vector ?/ with Wj^i = ior 1 < j < u, 1 < i < n — k 
when represented as in (pSl). The set (transversal) consisting of these coset representatives is denoted 
by Yg- On the other hand, cosets of cat(L, Lout)"*" in F^"'^ can be specified as follows in view of (^1]) : 
A coset of cat(L, Lout)"*" has the form 

{y I (fl'?\ l/)sp = i, I < 3 < ly, 1 < i < n - k; {g[, y)^^ = Ci, 1 < i < ku - k} 

= {y I z{y) = z; {g'-, y)^^ = cXi, 1 < i < ku ~ k} (42) 

with fixed z= ((2j-,i)i<i<n-A;)-^<^<^ and a = {<Ji)i<i<ku-K- We denote this coset by A(z,a). 

Given z = {zi, . . . , z^,) G (^f^-^y and v = (t>i, . . . ,v^) G (F^'^)'^, we denote the rearranged sequence 
(^(zi,Vi), . . . , {zi^,Vi,)) G (F""'^ X f^f^y by [z,v], and define a probability distribution P^^^, which is 
called the type of the sequence [z,v], by 

P.,(.,n)= '^^'^"-"-^ = ^^;"^'^^^^'^^' , .GF^-uGF- (43) 

and put 

p„(m) = J2 PzA-'^,u), u G F2^ 

which are the types of z and f , respectively. 

To make use of Lemma ^ we choose a representative from each coset A.[z, a) as follows. Among 
those sequences y that belong to both Yq and A(z,a), we choose one that minimizes Hc{Pz^v(y)) = 

H{Pz,v(y)\^z)^ where Hc{Q) is shorthand for H{Q\Q). We apply Lemma || defining Jq = Jo(-^out) as 
the set of these representatives. Denote J in the lemma by J(Lout), viz., 

J(Lout) = ^o(^out) + cat(L, Lout), (44) 

and put 

A = {Lout ^ F^^'' I Lout linear. Lout ^ L^^t, dim Lout = ku - k}. (45) 
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Then, we have 



E ^"^(^)^^ (46) 



|A| 

where 1[T] = 1 if a statement T is true and 1[T] =0 otherwise, and 

m = G A I y ^ J(Lout)}, y e F^"^ 
The fraction |B(|/)|/|A| is trivially bounded as 



< 1, ye F''^^ (47) 



We use the next lemma [|ri|] , which is a variant of a fact established by Calderbank et al. 
Lemma 1: Let 

A(x) = {Lo,tGA|xGLi,,\{0}}. 

Then, |A(0)| = and 

|A(x)| c^'^^+'^-l 1 

|A| rf2fei. _ ;l - d^v-K-- ' ^ 

From the design of Jo(Lout) specified above, it follows that 

B(z/) C {Lout G A I 3t;' G F2^•^ //c(P.(y),.') < ^^c t^' - ^(z/) e \ {0}}. (48) 
[This can be seen as follows. First, we consider the case where y G Yq- Recall A(2', cr) was defined by 



and observe {g[,v{y))^^ = {g'^,v{y))sp = {,g'i,y)sp from (|0]), where again v = v{y) and z = z{y) 
are specified by (^), (0) and (|4l|) . Hence, x eYq and ?/ G Fg are in the same coset of cat(L, Lout)^ 
if and only if 

z{x) = z{y) and {g[, v{x))sp = {g[, v{y))sp, 1 < i < ku - k 
by (0), which can be restated as 

z{x) = z{y) and v{x) - v{y) G L^^^. 



Since, at the beginning of the paragraph containing (|44|) , we have chosen a coset representative that 
minimizes He in A(z,a) fl Fg for each coset A{z,a), it follows that for any y G A{z{y),o-) fl Yg, the 
condition y ^ J(Lout) occurs only if there exists a vector other than y in A(2;(?/),(t) fl Yq that has 
conditional entropy as small as y, which implies (^8]). To see this for a general y G F^"'', note that 
for a fixed coset of cat(L, Lout), either each element y of the coset satisfies Lout £ B(?/) or each satisfies 
-^out ^ B(?/) because of Lemma |, or specifically, (|4^ ) in this case. Especially, Lout ^ B(?/) if and only 
if Lout G B(a;) for the vector y E Yq with y — y E span G, so that we can judge whether y ^ J(Lout) 
occurs or not by checking the condition y ^ J(Lout) for y eYq with y — y E span C] 
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Owing to (|48|), we have 



\m\ < E \A{v'~v{y))\ 

^'e(F2fe)-//c{P,(^),„,)</fc(P,(j,),„(j,)) 

< \A\d-'"'^^, (49) 

^'G(F2fe)-Hc(P,(^),„,)</fc(P.{j,),.{j;)) 



where the second inequahty is due to Lemma ^. Then, from (p6D , p7|) and (|^), it follows that 

1-^^ E E E ^"'(^) ^i^l E 1 



Recalling the probability distribution defined by ( pBD and the transformation that converts y into 
y' in (P^D, we have 

where P£ denotes the product of v copies of Pl, and hence, the above bound can be rewritten as 

rf-^'^'^-'^), 1 i. (50) 



i-F< Yl E pm^M)^A E 



gpafc^iHcCP^ „,)<Hc(Pz,«) 



Now, we will go into an argument using the method of types ||T5[. We put 

V. = P,(F"-'=) = {P.\ze {r-'^Y}. 

For a type Q G we define a set of stochastic matrices Wu{Q) by 

W.(Q) = {V\3ze (F"-'^)^ 3?; e (F2'^)^ P, = g and = V}, 

and put 

= max|W.(g)|. 



Zn—k 



Here, the probability distribution ^(-js) is allowed to be undefined for some (but not all) s G F*^ 
the equality between stochastic matrices V and V means 1^(m|s) = V'{u\s) for all u, s for which either 
V{u\s) or V'{u\s) is defined. For a type Q G of a sequence in (F""^)*^ and V G Wi.((5), a probability 
distribution Q x y on F"~^ x F^^ is defined by 



[Q X v]((s,m)) = g(s)\/(M|s), s G F'^-^M G F2^ 

which is the type of a sequence in (F"~^ x ?'^^y . Here we understand [Q x V]{j^s,u)) = for s with 
Q{s) = 0. The set of all possible types Q x F of sequences [z,v] G (F"~^ x f'^^y is denoted by 

n(F"-'= X F^'^). 

For Q eP^ andV e W^{Q), i.e., for Q x 1/ G P^(F"-'= x F^^), a set of sequences T^^v is defined by 

^Qxy = {[^,^] e (F"-'^ X F''r I P,,, = Q X y}. 
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Hereafter, we write Pl{s,v) and [Q x V]{s,v) in place of Pl[{s,v)) and [Q x V^]((s,f)), respectively. 
For a fixed sequence z G F*^"^, and a stochastic matrix \/ G VV,y(P2), we define 

r^^(z) = {ve (F^^r I pZ^ = V}, 

which is called the l^-shell of z |T^. Clearly, the cardinality of Ty{z) is uniform over sequences z of a 
fixed type Q, and hence, we can put 

T^iQ) = \raz)\, 

where = Q. We use the following two basic estimates [^4|, Lemmas 2.5 and 2.6], |T5| , Eqs. (II. 5) and 
(11.7)]: 

PT{Pz,v = Qy<v} = \r^,y\ n PL(s,«r"3"^i(^'") 

(s,u)6(F"-'=xF2'=) 

< exp,[-uD{QxV\\PL)], (51) 

where Q G 7^,^, V G WuiQ) and the sequence of random variables [z, v] that takes values in (F""'^ x f'^^y 
is drawn according to P^; 

T^(Q) < exp,[z/if(\/|Q)], g G P., y G W,(g). (52) 
We arbitrarily fix i?, < i? < 1, and put 

K = [A;i?z/] , 

so that the information rate n/{nv) of the concatenated code is not less than kR/n. From (|50D , (|5lD , 
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(52) and the inequality min{a + 6, 1} < min{a, 1} + min{6, 1} for a,b > 0, we have 



F < Yl PLi[z,v]) mm! Yl 1 

QeV^V<^W^{Q) {s,«)e(F"-'=xF2'=) 



X min< 



V"GW,(Q): //(y'|Q)<-f/{vio) 



< rf^ 5^ exp,[-z/D(gx\/||P^)] 

X 5^ exp,[-z/|A;-fci?-if(r'|g)|+] 

V'eW^(Q): H{V'\Q)<H{V\Q) 

< dJ2 E exp,[-z/D(gx V||Pi)] 

x|W^(g)| max expJ-iy\k-kR-H(V'\Q)\+] 

= dJ2 E (^Wd[-'^D{QxV\\PL)]\W,{Q)\exp,[-u\k-kR-H{V\Q)\+] 

< dj" \WJQ)\'^ max expJ-uD(Q x V\\Pl) - ulk - kR - H(V\Q)\+] 

^ yew,.(Q) 

< dlVJW^ max expJ-uD{Q x V\\Pl) - iy\k - kR - H{V\Q)\+] 

QeVu,V£W„{Q) 

< d\V^\W^expa{ min [D{P'\\Pl) + \k - kR - H{P'\P)\+]] 

P'G'P^{F"-'=xF2fc) 

= ci|P^|iyJexp^ { -umm[D{P'\\PL) + \k - kR - i/(P'|F)|+]} 

= d|P,|l^Jexp,[-z/E„,fc(i?,Pi)]. 

This bound on 1 — F is trivially true for R> 1. Note that \Vu\ and W^, are polynomial in u. We see 
the bound in the theorem upon putting R = R'n/k. 

IV. Appendix D: Proof of Lemma g 

Let p = (dimC)~^nc and assume C = C^^^ without loss of generality as in Section |III-B| . To prove 
the lemma, we will show two equalities 

S{A''^{p))=H(PL)+k (53) 

and 

S{[1^ A'^'^jmm) = H{T^) + HiKlK), (54) 

where l^') is a purification of p, which will establish the statement. 

The interpretation of errors N^, x G F^", in terms of the basis {gi, hi, . . . ,gn, hn} in Section |11I-B 



is useful to see ( ^31) and (|54D . Namely, we trace the action of an error N^., which can be viewed as 
XuZu'N^'^^ N^''^^ as discussed in Section [111-B| . Equation ( p3D holds because (dimC)~^nc is conveyed to 
(dimC*^^^)~^n^{s) by A^'-*-* with probability Pl{s), where 'y{x)2i = Si for 1 < i < n — k and C^^^ has been 
given in ( pO|) or (^Sf) , the subspaces C^'^\ s G F*^"*^, are mutually orthogonal, and the action of 
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is similar to that of a tensor product of Pauli matrices or Weyl unitaries, which leaves the operator 
11^(3) unchanged. 

Similar reasoning results in (15^) . In this case, we trace the action of errors / (S> A^^ on the state 
|\E')(\E'|, where 

1^) = ^ J2 1^1'- --'^fc) ® |0,...,0,6i,...,6fe) 

(fei,...,fefc)GFfe 

is a purification of (dimC)~"'^nc. The action of A^(*)A^W (in fact, / ® A^^'^^A^M in this case) is similar to 
the previous case. To see how XuZu' acts on the states, we use the next fundamental lemma on CP 
linear maps. 

Lemma 8: Let H' be a Hilbert space with an orthonormal basis {\ro), ■ ■ ■ , |rx-i)}. A linear 
map Ai : L(H') L(H') is completely positive if and only if [1 7V1](|$)($|) is positive, where I is the 
identity map on L(H'), and 

Moreover, if we represent [I ® as 



K 

0<i<K 



[I®A^](|<l>)($|)~i,^m,.m: 



t 

xey 

where ~ indicates that the right-hand side is the matrix of the operator on the left-hand side with 
respect to the basis {|rj) (g) ,,)g{o,...,ft:-i}2, i-e., 

T ~ itij,ki){i,j,k,i)e{o,...,K-i}^ < ^ T = ^ tij^kii\ri) ® \rj))i{rk\ ® (nl), 

(ij,fc,/)g{0,...,ii'-l}4 

and rearrange the elements of 

n^x = ("^x,0,0; • • • ; fTT'x,0,K-l, • • • , fTT'x,K-l,0^ • • • ; 'm'x,K -1,K -l)'^ ^ 

into the matrix form 

(fT^xfifi ■ ■ ■ '^x,K-l,0 \ 
^x,0,K-l ■ ■ ■ ^x,K-l,K-l/ 

then we obtain an operator-sum representation of M.: ~ {Mx}xeyi where is the matrix of 
with respect to the basis x Ey. O 

Due to Lemma |^, the matrix of [I ® ^®"](|\E') (\E'|) with respect to the basis that consists of 



(si, . . . , sn-k) e F"-^ (61, 6,), (&;,... , b',) e F^ 

is block diagonal [where the basis elements are arranged in a lexicographic order on (si, . . . , s„_fc; 
bi, . . . ,bk;b[, . . . , 6'^)], and owing to Lemma ||, which we apply putting K = and 

1$) = ci-'^/2 \bi,...,bk) ® \Si,...,Sn-k,bl,...,bk) 

to each block, von Neumann entropy of the block with label s = (si, . . . , Sn-k), after normalization, 
equals Shannon entropy of Y conditional on X = s, where the pair of random variables (X, Y) is drawn 
according to Pl- Thus, we have (^4]), completing the proof. 
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V. Appendix E: Proof of Lemma | 
Barnum et al. p. 4162] have shown the inequahty 

S{p) < /c(p, + 2 + 4[1 - Fe(p, 7^„^®")]n, (55) 

which holds for any state p in L(H^"), channel A and TPCP linear map 7^„ : L(H®") ^ L(H®"), 
where Fg denotes the entanglement fidelity. Also it is known that F{C,TZnA'^"') > 1 — 1] implies 
Fe((dimC)-^^c,7^„^®") > 1 - {3/2)r] [g p. 1324, Theorem 2]. Putting p = (dimC„)-inc„ in (|5|) 
and assuming C„ G T„ and F(C„, T^^^®*^) — 1 as n goes to infinity, we have 

log,dimC„ /c((dimC)-^nc,^®") 
hmsup < iimsup sup , 

and hence, the lemma. 

VL Appendix F: Proof of Lemma |B| 

First, note that in the proof of Theorem |I] (Section |V| and Appendix we have assumed that 
the operator basis N = {Nu}u&x employed for code design is exactly the same as that used in the 
representation {\/P{u)Nu}uex of the Pauli channel, and that the proof has actually shown 

rr/iiJWMU^> ^c((dimC)-^nc,^®") 

CLAUS„ NU >sup max — ^ 56 

for any Pauli or N-channel A ~ {y^P{u)Nu}uex- 

Put ^ 

c„ = ^max^/c((dimC)"^nc,^®") = max[fc - if(P^|Pi)], n=l,2,..., 

where the second equality is due to Lemma ^. From (|5^) and Lemma ^ with T„ = S„(N), we will 
obtain the lemma if we show that the limit of c^/n exists and 

lim Cn/n = sup c„/n. (57) 

To do this, we will show 

Cn+n' >Cn + C„/. (58) 

The fact that (^8]), together with the boundedness of Cn/n, implies ( pT]) for a general sequence of real 
numbers {cn} has often been used in (quantum) information theory |Q, 0, p5 |. 

Now let L C F^" and L' C F^" with dim L = n — k and dim L' = n' — k' achieve the maxima of 
k-H(pl\P^) and k' - h(p[,\P^>), respectively. Recall that h(Pl\Pl) and h(p[>\P^>) are determined 
from coset arrays of L and V defined in Section PLC| . All we have to show is the existence of a 
self-orthogonal subspace L" C F^("+"') with dim L" = n + n' — k" such that 



k" - H{Pl»\Pl") >k- H{Pl\Pl) + k'- H{Pl'\Pl'). 

We can see that 

LL' = {xy e F"+"' \ xeL,yeL'} 

with dim LL' = n + n' — {k + k') , where xy denotes the vector obtained by pasting x and y together, 
is such a code as follows. We consider probability arrays of L and L' as in (pQ]). Then, it is easy 
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to see that the d^^ 



^' X (f''^^^''^ array whose (s,M)-entry is Pll'{ss', 



uu 



[s ,u 



e F 



n'-k' 



X F^^', is a probabihty array of LL'. From this array, we have 



PLis,u)PLis',u') 



k + k' -H{Plu\P^') 
= k + k' -H{Pll') + H(P^) 
= k + k'- H{Pl) - H{Pl>) + H{Pl) + H(P^,) 
= k-H{K\PL) + k' -H{%\P^,). 

[To see these equahties, introduce random variables X, Y, X', Y' such that Pr{X = s, Y = m, X' = s', Y' 
u'} = Pl{s,u)Pl'{s' ,u').] Hence, we have ([58| ) and consequently the lemma. 
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